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HANKEL OPERATORS IN SEVERAL COMPLEX VARIABLES AND 

PRODUCT BMO 

MICHAEL LACEY AND ERIN TERWILLEGER 

Abstract. H 2 ((g)™C+) denotes the Hardy space of square integrable functions analytic in each 
variable separately. Let P e be the natural projection of L 2 ((g>™C+) onto H 2 (®"C+). A Hankcl 
operator with symbol b is the linear operator from // 2 (<g>™C+) to H 2 ((g>™C-(-) given by H;, ip = 
P Q bifi. We show that 

||Hj,|| ~ \\P®b\\BMO{®fC + )' 
where the right hand norm is S.-Y. Chang and R. Fefferman product BMO. This fact has well 
known equivalences in terms of commutators and the weak factorization of H 1 (®"C+). In the 
case of two complex variables, this is due to Ferguson and Lacey [8]. While the current proof is 
inductive, and one can take the one complex variable case as the basis step, it is heavily influenced 
by the methods of Ferguson and Lacey. The induction is carried out with a particular form of a 
lemma due to Journe [10], which occurs implicitly in the work of J. Pipher [13]. 



1. Introduction 

We characterize the boundedness of Hankel operators in three and more complex variables in 
terms of the BMO norm of the symbol of the operator. In one complex variable, this and related 
facts are the circle of ideas around Nehari's theorem. In the case of more than one complex variable, 
there are different types of Hankcl operators, and we only consider the so called little Hankel 
operators; little in the sense that the projection used in the definition is onto the smallest natural 
choice of subspaces of L 2 (R") to use. The structure of these Hankel operators is more intricate due 
to the more complicated structure of the Hardy spaces H 1 in the product domain and their duals, 
as identified by S.-Y. Chang and R. Fefferman [[1],[2], [3]]. Some of the tools that have proved to 
be so flexible and powerful in the one parameter situation apparently have no analog in the higher 
parameter case; these spaces remain, to a significant degree, poorly understood. 

We prove the natural statement about the boundedness of little Hankel operators in an arbi- 
trary number of complex variables. Namely, the Hankel operator with symbol b is bounded iff the 
projection of b into product Hardy space is in product BM O. Central to this paper is the result 
of S. Ferguson and M. Lacey [8] that established a similar characterization for Hankel operators of 
two complex variables. The current proof is inductive in nature, and one can use the classical one 
variable statements of our theorem as the base case in the induction. In particular the methods of 
[8] are not sufficient to prove the Theorem in this paper; the inductive argument is the essential 
new argument in this paper. 

Recall that L 2 (M) has the orthogonal decomposition H 2 (R)oH 2 (M) . Let P be the corresponding 
orthogonal projections onto the analytic/antianalytic spaces. 
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In n variables, let Pp be the same projections acting on the j'th coordinate, j € {1,2,..., n}. 
For functions a : {1, 2, . . . , n} — > let 

n 

pCT = n p j ° ,} • 

It is clear that L 2 (R") has the orthogonal decomposition into 

L 2 (W n ) = ®a P a L 2 (W n ). 

We take © to be the function from {1,2,..., n} that is identically +. It is clear that P® L 2 (W l ) = 
H 2 (^C+). 

For a function &, we set the Hankel operator with symbol b to be Hb / = P e bf, defined as a map 
from H 2 (^C+) to H 2 (^C+). P e is the projection from L 2 {W l ) onto H 2 (^C+). Clearly, this 
operator depends only on P® b. 

Theorem 1.1. We have the equivalence of norms 

(1-2) ||H fa || ~||P ffi 6|| BMO (^c + )- 

Here BMO(®"C+) is the (analytic) Bounded Mean Oscillation space, dual to 7J 1 (£3>™C+), as 
identified by S.-Y. Chang and R. Fefferman. This theorem has two well known equivalences. One 
is in terms of the commutator 

C b := [••■[[M 6 ,Hi],H 2 ],--- ,H„], 

in which M& is the operator of pointwise multiplication by &, and IF, denotes the Hilbert transform 
computed in the jth coordinate. The commutator is a sum of 2 ra Hankel operators, each coming 
from one of the 2" orthants of M". In particular, if the signature of a is sgn(er) = Ilj=i a ti)> a 
straightforward computation shows that Cb = — 2" s g n ( cr ) P _<T Mb P CT ■ Thus, the upper bound 
for the Hankel operators \\Hb\\ ^$ ||&||bmo immediately extends to an L 2 operator norm for the 
commutators. Conversely, assuming the commutator is bounded on L 2 (£g>"C+), a number of Hankel 
operators with the same symbol are also bounded. Namely, the Hankel operators arc from P CT L 2 to 
P~ a L 2 . Fhus the lower bound follows. That is, we have 

\\Cbh — \\ b \\BMO(®?C + )- 

The latter space is the real BMO space. 

A second equivalence is in essence a dual statement to the estimates above, and hence is a 
statement about iT 1 (cg>iC+). It gives us a weak factorization result for that space, namely 

(1.3) # 1 (® ? 1 l C+) = H 2 (^C + )QH 2 (^C+), 

where the right hand side is the projective tensor product of H 2 . This equality plays a role in our 
proof, and so we return to it below. 

For the proof, the strategy is one of induction on the number of parameters in a manner analogous 
to the overall strategy of Ferguson and Lacey [8], which addresses the two parameter case. The 
upper bound on ||Hf,|| in (1.2) is in fact easy to obtain, a fact which is easiest to see via the trivial 
inclusion in (1.3). Thus, the real difficulty in our theorem lies in the lower bound on ||Ht,||. Here, 
there is a bound which follows from the one parameter theory, namely that the operator norm of 
Hf, is bounded below by the "rectangular BMO" norm of b. It is well known that the rectangular 
BMO norm is essentially smaller than the BMO norm. Ferguson and Lacey [8] showed how to use 
the Journe Lemma [10] to pass from this essentially smaller norm to the BMO norm of Chang and 
Fefferman. 

A formulation of the Journe Lemma for rectangles in three and higher parameters is due to 
J. Pipher [13], but the direct application of this lemma cannot succeed in a proof of our theorem. 
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The reasons are both technical and heuristic. Relying on just the rectangular BMO norm in three 
and more parameters does not take advantage of the subtle way that the n parameter BM O space 
is built up from the n — 1 parameter space. 

We find that this point of view, and a form of Journe's Lemma we need, as stated in Section 6, 
are implicit in the paper of J. Pipher. To use the Journe Lemma, we need to make a definition of 
BMO-i(®"C+), which is applied to a function in the n parameter setting. Our induction argument 
then, in proving the lower bound in the n parameter setting, is to first derive the weaker bound 
of BMO-i(8fC + ). And then prove the correct BMO bound, assuming that the BAfO_i(<g) r 1 l C+) 
norm of the symbol is sufficiently small. 

By A < B we mean that there is an absolute constant K for which A < KB. K is allowed to 
depend upon relevant parameters. 

We are indebted to J. Pipher for sharing some of her insights into the Journe Lemma, and to the 
referee for a quick and helpful report. 



2. The Upper Bound 

The upper bound ||Hj,||2 < ||^||bmo((»5 1 C + ) can be seen by a soft proof. Consider the Hankcl 
operator Hf,, 



|H 6 || = sup f(P e bf)gdx 



ll/l|2=i,||g||2=i 



(2.1) = sup P^bfgdx. 

||/||a=l,||»||2=l 

Since the product of H 2 functions is in H , we see that the integral above admits the upper bound 
of ||P®&|| 

bmo(®?C+)- This is the upper half of Theorem 1.1. 
We turn to the weak factorization result. For A, B closed subspaces of L 2 (R"), we define the 
projective tensor product A&B C L 1 (M n ) by 

oo oo 

AQB := {h = fm | (fj) Q A, ( 9j ) C B, and ^||/,|| 2 ||.9,|| 2 < oo}. 
i=i j=i 

Observe that that (2.1) implies that the Hankel operator with symbol b is bounded if and only 
if the function P b is in the dual of H 2 (^ , IC + )QH 2 (^C + ). Therefore, the weak factorization 
equivalence (1.3) is equivalent to our main theorem. That is, we have the equivalence 



|H 6 || c \\b\\ (H , 



(ff 2 (®fC + )0i/ 2 (®5*c+))* 



3. Wavelets, BMO(®?C+) and BMO- 1 {® r {<C + ) 

We begin with some preliminary definitions and calculations in the one parameter setting which 
carry over naturally to the higher parameter setting. The proofs in the rest of the paper use analytic 
wavelets constructed by Y. Meyer [11] which are compact in frequency. Let w be a Schwartz function 
with \\w\\2 = 1 and «5(£) supported on [2/3,8/3]. Therefore the wavelets and projections have the 
nice decay estimates 

< {1 + \x\)- n forn>l. 
Let T> denote the dyadic intervals on KL For an interval I € T>, define 
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where c{I) denotes the center of /. Note that the functions wj{x) are well localized to the interval 
/. Indeed, 

i i/l + dist(x,/)\-« 
\wi{x)\<\I\ ^ |jp -J forn>l. 

Y. Meyer has shown that we can choose w so that {u>/} r g x> form an orthonormal basis on iJ 2 (<g>™C+). 
Another useful property of these functions is that we have the Littlewood-Paley inequalities, 

I0>/) 12 



1 < p < oo. 



I 1 lev 1 1 

We are now ready to define a characterization of product BMO(®iC+) due to S.- Y. Chang and 
R. Fefferman [2]. Let 1Z = T> n = g)™ =1 X> be the dyadic rectangles. For a rectangle R = &j =1 Rj G 1Z, 
define 

n 

v n(x) ■= Y[w Rj (xj). 

3 = 1 

We say / G BMO(^C+) iff 



sup 

u 



wr 1 J2 K/.«fl)i 2 



< oo 



flee/ 

where {/ is an open set in R" of finite measure. 1 We denote this supremum by ||/||bmo(®jc+)- It 
is a theorem of Chang and Fefferman that this definition coincides with the norm of the dual to 

We now define a weaker notion, which we dub BMO-\{®i'C+). For a collection of rectangles 
Wc7l, set the shadow oili, to be 

sh(W) := (J R. 
Rett 

We say that IA has n — 1 parameters iff there is a coordinate 1 < /c < n and a dyadic interval / so 
that for all R € U, we have = /. We then define 

\\b\\BMO- 1 (®?c + ) = sup |sh(W)| _1 ^ |(6,w fl )| 2 

n — 1 parameters'- RelA 

Here, we note that the definition depends only upon the projection P® b. In two dimensions, this 
reduces to a notion that is just slightly weaker than the notion of rectangular BA10, which is well 
known to be essentially smaller than the BMO norm. 

4. The BMO^i{® 1 l'C + ) lower bound 

An essential part of the argument is to use the induction hypothesis to show that we have the 
lower bound 

l|H b !| > ||P e 6!| B MO- l( ^c + )- 

This amounts to the assertion that 

(4- 1 ) \\b\\(Hi(®™c + )om(®™c + ))* ~ \\ b \\BMO- 1 (^c+), 

an inequality we will demonstrate by relying on the truth of Theorem 1.1 in the n — 1 parameter 
setting. 

Given a symbol b = b(x\,X2, ■ ■ ■ ,x n ) = b(xx,x') of n variables, we assume that b is analytic in 
all variables and has |j &|| j ba/o_i(<s>^C+) = 1- We also take as given a set U of rectangles in T> n of 



^This is analytic BMO, as we are using analytic wavelets. Real BMO has a similar definition, provided one uses 
wavelets that form a basis for L 2 (R). 
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ri — l parameters. Thus associated to U are the dyadic interval / and the collection US n x > C T> n 1 
as in the definition. We assume that |/| = 1, |sh(Z/)| ~ 1, and for all R € U we have R\ = I and 
R = I x R'. 

Our claim is then that the function 

REU 

has i7 2 ((K)"C+)0iJ 2 ((8)"C + ) norm < 1, which, together with (ip,b) = 1, certainly proves (4.1). 
Thus it suffices to show the claim. 

Now, since for each R € U, we have vr(x±, x') = wi{x\)vri{x'), and 

i/)(xi,x') = Wi(xi) ^ (b, Vr)vr< (x') :=wi(xi)ij/(x'), 

REU 

we can utilize factorization results in both x\ and x' . For x\, we use the classical inner outer 
factorization to conclude that 

wj = wf ] wf\ ||4 1) H^(c + )lkf ) ll^(c + ) < 1. 

Concerning the function ip' , by our choice of U and the square function characterization of the Hardy 
space, we observe that ip' has H 1 {<S^~l'C + ) norm at most a constant. By the induction hypothesis 
and, in particular, the assertion that H 1 {® , -Zl<C + ) = H 2 (®"=iC + )0iJ 2 ^(<8>™r l 1 C + ) , we can write 

V»'0O = ^2M x ')<t>kW), Z]II^II^(®7- 1 i c + )II^IIh=(®7- 1 i c + ) ~ L 

k k 

Hence, writing 

k 

we see that our claim holds. This completes the proof of the lower bound. 

5. The BMO lower bound 

An example of Carleson shows that the BMO-i(®"C+) bound is essentially smaller than the 
BM O norm. In particular, some tool is needed to pass to the larger norm. That tool is a Journe 
Lemma, which we detail in the next section. 

We can assume that ||6||bmo(®jc + ) = 1 an d seek an absolute lower bound on ||H&|| . In the course 
of the proof, we will need absolute positive constants 5—i, Sj ourn ^, 5%, and 63. Other parameters, 
termed "diagonalization parameters" , are introduced to gain convergent geometric series. The 
parameters used for these will be denoted with the letter d with various subscripts. We assume that 
IHI_BAfO_i((8"C + ) < <5-i? f° r otherwise we have an absolute lower bound on 1 1 H5 1 . 

Take a set of rectangles U which achieves the supremum in the definition of the BMO norm of 
B. We can assume, after a harmless dilation, that | < |sh(W)| < 1. We will show that 

(5.1) ||H 6 Pp]6|| 2 ><5 3 . 

Here we use the notation P[U] = ^Z r&u vr ® vr, and so P[U]b = J2reu(^^ v r) v R- Establishing the 
lower bound on the norm of the Hankel operator will require some careful analysis which centers 
around a variety of paraproducts, proper formulation, and application of a lemma due to Journe 
which is specified in Section 6. 

From the discussion in Section 6, there is a set V D sh(U), satisfying several conditions, among 
them 

journe 

)\sh(U)\. 



(i 
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Take the collection of rectangles V and W to be 

V := {R e K | R C V, R(t sh(W)}, 

w :=n-u- V. 

We shall prove that for absolute #2 > 0, 



(5.2) ||P e P[W]6P[W]6|| 2 > <5 2 , 

5 1/2 . 

journc ' 



(5.3) \\P e P[V}bP[U}b\\ 2 <S^ 2 



(5-4) ||P° P[W]bP[U]b\\ 2 < K S}omt S-i. 

The last inequality holds with a constant that depends only on <5j urnc- Thus, fixing first <5j urne 
sufficiently small and then S—i proves (5.1). 

The first two estimates are trivial, as we indicate now. First, note that the Fourier transform of 
|P[t/]6| 2 is symmetric, so that 



\\P e P[U]bP[U}b\\ 2 > 2-™||P[W]6||l. 

The L 4 norm has a lower bound, due to the fact that we have taken the shadow of U to have measure 
approximately one and the validity of the Littlcwood-Paley inequalities. Thus, 

ii \{b,v a )\* iV2 

R&A 1 1 



< 



\P[U]b\\ 



This proves (5.2). 

Second, use the control on the size of V to see that 



\P[U]b\\i + \\nV}b\\l = + \\P[V]b\\ 2 2 < \V\ < (1 + <Wnc)|sh(W)|. 

! < ^journc- By t 

we can prove (5.3) as follows. 



Thus, ||P[V]6||2 < journc- By the John-Nirenberg inequality, we see that ||P[V]6||4 < ^jomnc- Hence, 



'p[v}bpp}bh < \\mb\u\\p[v}b\u<s^ n6 . 



The Definitions of the Paraproducts. The principal inequality is (5.4), and it requires a sus- 
tained analysis to verify. It is imperative to observe that the term Hp[ W ]t,P[W]6 has a sizable 
cancellation as a sum over wavelets. If R and R 1 are two dyadic rectangles with \Rj\ < 8\R'j\ for 
any 1 < j < n, then we would have 

P e Vr.vr- = 0. 

This is due to the fact that in the jth coordinate, the Fourier transform is not supported in £j < 0. 
Thus, we can replace the definition of W by: 

W = {R' e K | R' (t V, and for some ReU, \Rj\ < 8\Rj\ for all 1 < j < n}. 

It is also imperative to observe that even with this restricted definition, the shadows of U and 
W will, in general, overlap. This overlap will be controlled by the Journe Lemma and additional 
orthogonality considerations. 

Nevertheless, the sum should be analyzed along the lines of a product of two functions which 
are nearly supported on disjoint sets. The technique for doing this is via sums known generically 
as paraproducts. In n parameters, the paraproducts admit different degeneracies, as measured in 
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the amount of orthogonality present in the sums. It is the purpose of the following definitions to 
quantify these paraproducts. 

Given a subset J C {1,2, ... ,n}, write R' -<j R iff for indices j £ J, we have 8\R'j\ < 
whereas for indices j £ {1,2, . . . ,n} - J, we have 8 -1 |.Rj-| < \Rj \ < 8|i^-|. Set 

X(J) := {(R', R)£W xU\R' <j R} 

M J ) '■= (b,V R /)v R /(b,V R )v R . 
(R',R)GX(J) 

The remainder of the proof is devoted to the assertion that 

(5-5) \\W)h < Kt ioutnS S-i, JC {1,2,..., n}. 

This objective can only be met with additional diagonalizations of the sums. Applying the Journe 
Lemma as stated in Lemma 6.10, we can decompose U into collections ti^, for d\ £ N, for which 
we have 2 dl Rc V for R£U dl , and 

(5.6) \\nU dl ]b\\BMO < A' 5journ ,2("+ 1 ) dl ||6|| BMO _ l(8?c+) < 2^ +1 ^S^. 

In what follows, we shall suppress the dependence of these inequalities on the choice of 5j OU rnc, 
which comes only through this application of Journe's Lemma. Also the (large) power of d± is of 
no particular consequence. From another part of the estimate we can pick up a factor of K^2~ 1 
for arbitrarily large N . 
For integers d 2 >d\, set 

X(J, d 2 ) := {{R 1 , R) £ W x U dl | R' <j R, R' C 2 d2+4 R, R' £ 2 d2 R} 
X(J,d 2 ):= ^ (b,VR')vR'(b,v R )v R . 

(R',R)eX(J,d 2 ) 

In this notation, and below, we will suppress the dependence upon d±, as this parameter does not 
directly enter into any of the estimates. We shall show that 

(5.7) ||X(J,d 2 )|| 2 <2- d2 <5_i, JC {l,2,...,n}, < d 1 < d 2 . 

This estimate proves (5.5). 

Orthogonality enters into the estimate in the following way. Suppose we are given two pairs of 
rectangles (R,R') and (R,R') in X(J). In addition, suppose 16|i?^| < \R'j\ for some j £ J. We 
conclude that the functions v R iVr and v Rr Vfi are orthogonal. This is seen by examining the Fourier 
supports of the wavelets. Therefore, for |J|-tuples of integers I £ Z> J >, we define 

X(J,d 2 ,7) := {(R',R) £ X(J,d 2 ) | |^| = 2 7 ^, j £ J}. 

X(J,d 2 ,£):= ^2 (b,v R >)v R/ (b,v R )v R . 

(R',R)ex(J,d 2 ,~t) 

Here, for simplicity, we have assumed that J = {1, 2, . . . , |J|} for notational convenience. We will 
continue with this assumption throughout. All estimates will clearly be invariant under appropriate 
permutation of coordinates. In light of the orthogonality above, it is the case that 

(5.8) \\X(J,d 2 )\\ 2 2 < J2 \\nJ,d2,~t)\\l 

At this point, we can abandon orthogonality considerations altogether in estimating this last 
sum. In the sums X(J, d 2 , £ ) we have the functions i>.r'Vr, which can be dominated as 

[|i?'||i?|] 1/2 K'«fl| < [(£*' * Ih'XCh * i«)] 2 
< M1 r {c(R')) n Cr'*1w, 
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where we take 

C*(a:) := [1 + \xil\Ri\~ 1 + ■■■ + KIKr 1 ]" 100 "- 

Here, N > 1 is arbitrary, though the implied constant will depend upon the choice of N. M 1r(c(R')) 
is an effective measure of the distance between R' and R, as R' will always have dimensions which 
are smaller or comparable to those of R. 

Of course, for (R 1 , R) G X(J, d 2 ), we have M l R (c(R')) < 2~ Nd2 . Thus, 

||X(J,d 2 ,T)|| 2 <2- Ard2 ||X(J,rf 2 ,7 > )|| 2 , where 
X(J,d 2 ,7):= Y flgj/W lRI , an d 

/3(i?) :=|<M«>I- 

The top line holds for all large integers iV. Thus in the argument below we can accrue some bounded 
number of positive powers of 2 d2 and not place our desired estimate in jeopardy. Hence, to obtain 
(5.7) it is enough for us to show that 

(5.9) l|X(J,d 2 ,t)||2<2 8 " d2 ^ 1; J C {l,2,...,n}. 

As the terms X are sums of indicator sets of rectangles, we can appeal to facts about Carleson 
measures and, in particular, the John-Nirenbcrg inequalities to control these sums. 

There is a final diagonalization to make. For | J|-tuples of natural numbers d% G N' 17 ', set 

x(j,d 2 ,7,ck) ■= {(R',R) g x(j,d 2 ,7) | 2*y|ity = \Rj\, j g J}, 

{R',R)&X(JA 2 , I ,da) v ' 

= 2 -iiiaii v m ' )m i„. 

^ \R'\ R ' 

(R',R)&X{J,d 2 , I ,ds) 

where ||d3|| := X^=iM3jl- The leading term of the last line suggests that indeed ds is a diagonal- 
ization parameter. 

With the relative sizes of R' and R fixed by the choice of J C {1, 2, . . . , n} and by the choice of 
^3, observe that for each R', there can be at most 0(2 nd2 ) possible choices of R so that (R,R') G 
X(J, d 2 , I , d 3 ). Again, we can afford to lose some bounded number of powers of 2 d ' 2 in our estimates. 
We take 

y(J, d 2 ,~t, da) ■= {R' I {R', R) G X(J, d 2 ,~t, ck) for some R G TZ}, 

y(j,ckX)-= (J y(J,d 2 ,7,T 3 )- 

Let 7r : y(J, d 2 , £ , efa) — ► TZ be such that (i?', tt(R')) G Af(J, d 2 , £ , d 3 ). We would need to consider 
0(2 nd2 ) possible choices for this function. Below, we will consider just some arbitrary choice of this 
function it, and then merely sum over the possible choices of tt, accruing a harmless term of 0{2 nd2 ) . 
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Set 



' = 2 -iii^ii v P(R'M<R')) 1 , 

ip/i R ' 



\R' 

R'ey(J,d 2 , i ,d 3 ) 



The specific estimate we prove is: 



\\Y(J,d 2 ,t)\\l-= E \\W,d2,~tX)\\l 

(5.11) tezi J i 

<2 8 ™ d2 " iU^II^.!, Jc {1,2,..., n}. 

This is summed over ^3 G N'" 7 ' to prove (5.9), and so will complete our proof. The proof of this 
inequality is taken up in the next subsection. We achieve an exponential decay in parameters 
di, d 2 , d 3 . 

We shall rely repeatedly on the estimates 

(5.12) E P( R ) 2 ~ 2 2(n+1)dl 5 2 _ 1 , 

R&A dl 

(5.13) E P{R') 2 <2 2nd \ Jc{l,2,..,nUeN |J| . 

The first of these has the critical gain by a factor of S 2 _ 1 , as follows from (5.6). The second estimate 
follows from the fact that b is in BMO and that the rectangles R' in y(J, d 2 , d 3 ) are contained in 
{M l s h(w) > c2~ nd ' 2 } since R' C 2 d2+4 R for some R £ Ud x - Here M is the strong maximal function. 
At this point we recap the notations. 

• d -l = II & IIbA/0_i(®7C + )- 

• d\ is associated to the measure of embeddedness of rectangles R e U. 

• d 2 is a (crude) measure of the separation between the rectangles R' £ W and R G U c i x for 
d 2 >d\. 

• J C {1, 2, . . . , n} is that set of coordinates for which one has some orthogonality. 

• I e Zl J l specifies the side lengths of R! for those coordinates j <E J. 

• ds G N' J ' specifies how much bigger R is than R' in the coordinates j £ J. 

• (R',tt(R')) G -^(J,d 2 ,T,^) and |7r(iJ')j| = 2^|i?^| for 7 G J, otherwise for j J, 

. /3(E) :=K6,w«>|. 



The Bounds for the Paraproducts. The argument varies depending upon the cardinality of 
J C {1, 2, . . . , n}. While we can formalize issues in a way that is uniform with respect to | J|, we 
present four subsections, to emphasize the differences that come about due to the increasing number 
of parameters. 

The Case of J = {1, 2, . . . , n}. The key point is that the sum in (5.11) simplifies considerably as all 
the side lengths of R' are specified by the parameter I . In particular, the rectangles R! occurring 



in 
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in the sum in (5.10) are pairwise disjoint. Thus, the L 2 norm in (5.11) will simplify to 



2-11*11 ^ 



li?' 

R'&y(.id 2 , i ,d 3 ) 
(3{R') 2 



R' 

R'ey(J,d 2 , i ,d 3 ) 

As b has BMO norm one, the supremum above is bounded by 1. Then sum over £ <G Z'" 7 ' and use 
(5.12) to see that 

||Y(J,d2,<fe)||2 < S 2 ^ 3 ^--^. 
Recall that we can tolerate a few positive powers of g?2- This case is complete. 

The Case of J = {1, 2, . . . , n — 1}. Now, the rectangles R' G y(J,d%, £ , (I3) are only permitted 
to vary in the last coordinate. That is, the corresponding sums are as complex as those of one 
parameter Carleson measures. So we can explicitly compute 

lY W *,?,S)| S - r i»i £ 

R'e¥(j,d 2 ,7,S) 



£ f3(R")/3(ir(R")). 



ft"e¥(J,d 2 , £ ,d 3 ) 
K"Cfl' 

With the specific way the innermost sum is formed, observe that 

E /?(£") 2 <£-iin 

fl"£Y(J,d 2 ,"?,*) 
R"CR' 

J2 P«R")? ^ 2"' l2+ ll^ll ( 5 2 _ 1 |i?'|. 

R" eY(J,d 2 ,~t Xs) 
R"CR' 

The first estimate is obvious, while the second estimate follows from the fact that the rectangles 
7r(i?') are contained in 

g)n =1 2*+£*J2j. 

In this last display, set the last coordinate of d% to be zero. Applying these observations, Cauchy- 
Schwarz, (5.13), and ||fo||sMO(®fC + ) = 1 we see that 

\\w,<h,t)\\i< si^-i^h 1 ^ J2 m'm^R 1 )) 

R'eY(JM 2 ,~t,d~t) 

< S 2 _ 1 2^ nd2 ~^^ . 

This completes this case. 
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The Case of < \J\ < n — 1. The argument in this case could be adapted to treat the general 
case. We would like to indicate the additional difficulty that one faces in this case. The side 
lengths of R' are fixed for those coordinates in J, and completely specified by £ £ Z' J \. The 
remaining side lengths of R' are then permitted to vary. Thus, the ways that two possible choices 
of R', R" S y(J, g?2 , £ , d^) can intersect are as general as the intersections of two dyadic rectangles 
of dimension n — \J\. 

Nevertheless, one can implement a method of proof that follows the lines of the case J = 
{f,2, . . . , n— 1}, provided one takes advantage of the John-Nirenberg inequality, which we now state 
in the form used. For rectangles R £ 1Z and non-negative constants clr for which X)i?cw a R — 1^1 
for all open sets W C R™, we have 



E 

RCW 



an 
\R\ 



h 



< 



\W\ 



i/p 



1 < p < 00. 



We use this to obtain the following extensions of the inequalities (5.12) and (5.13). In the first place, 
we have 



(5-14) 



E 

L -R'e> ) (./,d2 ,da) 



\R'\ 



1/2 



1 < p < 00. 



This is available to us from the fact that b is in BMO with norm one. A similar fact is 

1 1/2 



(5.15) 



L -R'ey(J,d 2 ,d 3 ) 



E 



R' 



< 2 



2nd--> 



1 < p < OO. 



The important features of these estimates are that they are independent of ^3, uniform in £ G Z'' 7 ', 
and in the second estimate we have the gain of 8—1. 

Now, the second estimate does not immediately follow from a BMO estimate, due to the fact 
that we have a mismatch between R' and 7r(i?') in (5.15). Due to the John-Nirenberg inequality, 
(5.15) will follow from the estimate 

53 PWR')) 2 <2 And2+ ^8 2 _ 1 \W\, WcR n . 

R'CW _ 

R'&y(j,d 2 X) 

As this estimate is uniform in the choice of W, it provides a bound for a Carleson measure to which 
the John-Nirenberg inequality applies. For a given W C K n , it is the case that for all rectangles 
R' that contribute to this sum, the rectangle tt(R') is contained in a set which is given in the first 
place by a strong maximal function applied to W. Set 



{Miff > c2~ nd2 }, 



for an appropriate choice of c. This set, so constructed, will contain a translation of R! which is 
contained in ir(R'). The point to keep in mind is that 7t(R') is 2 d3j times longer than R' in the 
coordinate j € J. Thus, in that coordinate, we should apply a one dimensional maximal function 
with threshold 2~ d3j . Namely, for j £ J = {1, 2, . . . , | J|}, we inductively define 



{M, In <-2 ^ \. 



For appropriate constant c, we will have n(R') C W\j\. And we certainly have \ W\ j\ \ < 2^ d3 ^ +2nd2 \ W\. 
This completes the proof of (5.15). 
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Estimates (5.14) and (5.15) are not in themselves enough to complete the proof, as there is no 
decay in the quantity ||d3|| . But, they do show that 



(5.16) 



< 



[ Y(j,d 2 ,?,4 



1/2 



-I 1/2 



E 

L R'ey(J,d 2 X) 
< 2 4nd2 5_i. 



E 



P{<R')) 2 
\<R')\ 



1 1/2 



Namely, we have an estimate on the L 4 norm that is uniform with respect to d^. This will permit 
us to select a set which decays with respect to this parameter. On this set, we will not attempt to 
estimate the L 2 norm in (5.11). The set we take is 

E:= (J {MY(J,d 2 ,t,^) >8- 1 2^}. 

Here, wc use the strong maximal function M. This set has measure \E\ < 2 16nd 2-^||d 3 |l 5 due t0 thc 
large L p norms we have in (5.16). 

To complete the argument in this case, it suffices to show that 

V f \Y{J,d 2 ,~l ,%)\ 2 £&<<5_i2 3 " d2 -*ll^ll. 



We will expand the square on the left hand side. Integrating Y(J, d 2 , £ ,d%) over R' — E, we will 
lose a factor of 2sll*H . But from |tt(.R')| = 2^ \R'\ we will gain a factor of 2^1^11 . Specifically, 



v / i¥(j,d 2 ,t,^)i 2 dx< E 



£ e z|J| R'ey{J,d 2 , i ,d 3 ) 

Y{J,d 2 ,~i ,ck) dx 



(3(R<) P(*(R')) 
VW\VWW)\ 



R'-E 

l l ^ 



< 5 _ l2 (l-l)ll*li J2 E P{R')P{<R')) 

< ^2 2( 2 ™+ 1 ) d a— ill^all 



This estimate follows from thc definition of the set E and (5.12) and (5.13). 



The Case of J — 0. In this case, both £ and <i3 are not present, and the rectangles R' and tv(R') 
have comparable lengths in all coordinates. But we do have (5.14) and (5.15), and they directly 
prove the desired estimate 



|Y(0,d 2 



< 



E 

lR'ey(0,d 2 ) 

ind 2 



P{R 



i\2 



\R' 



1/2 



E 

M'&y($,d 2 ) 



\<R')\ 



1 1/2 



And this completes this case. 
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6. Journe's Lemma 

We state a version of the Lemma of Journe [10] that is implicit in Pipher's extension [13], and 
interfaces well with our notion of a restricted BMO norm, namely -BMO_i(<E>"C+). We first state 
the lemma in a purely geometric fashion, and then return to a formulation that is more specific to 
our needs in this paper. 

The Geometric Formulation. Given a collection U of dyadic rectangles whose shadow has finite 
area, suppose that V D sh(U). For rectangles R EU, define 

emb(i?, V) := sup{^ > 1 | fxR 1 x R 2 x • • ■ x R n c V}. 

For an arbitrary subset W C U, let 

F(I, k,U') ■= \J{I x R' \I x R' eW, 2 k ~ 1 < emb(I x R', V) < 2 k }. 

Lemma 6.1. For all 5, e > 0, we can select V D sh(U) with \V\ < (1 + <5)|sh(Z-/)|, for which we have 
the uniform estimate 2 

oo 

(6.2) J2J2 2 ~ ek \ F ^' k ^\^\ sh ^% «'CW. 

k=i lev 

The implied constants in these estimates depend only on dimension and the choices ofe,5. 

Consider a collection of rectangles U in which all the first coordinates are the same. Then 
the embeddedness is necessarily of order 1. This example shows that the lemma above must be 
formulated in this fashion. 

We begin the proof with a careful description of how to select the set V. If we were not too 
concerned about the upper bound on the measure of V, in other words if the bound < |sh(W)| 
were enough, then we could simply take V — {M l s h(u) > 5}- F° r our needs, however, this choice 
of V is completely inappropriate. 

We need the notion of shifted dyadic grids, which is a modification of an observation due to 
M. Christ defined as follows. The definition of the grids depends upon a choice of integer d, and we 
will set 5 = (2 d + I) -1 . For integers < b < d, and a e {±(2 d + 1) _1 }> lct 

(6.3) V dAa := {2 fcd+b ((0,l) +j + (-l) k a) | k € Z, 3 £ Z}, 

d-l 
a b=0 

One checks that I?d,&.Q is a grid. Indeed, it suffices to assume a = (2 d + I)" 1 and that 6 = 0. 
Checking the grid structure can be done by induction. And it suffices to check that the intervals in 
f d,o,a of length one are a union of intervals in 2?d.o,a of length 2~ d . One need only check this for 
the interval (0, 1) + a. But certainly 

1 2 "- 1 ■ 1 



(°' i )+(FTi) = y n (o ' 2 " )+ ^ + (^Ti) 



3=0 

3+1 1 



|J(0,2- d ) 



2 d 2 d (2 d + l)' 

3=0 y ^ 1 



We have stated the lemma in the formulation for the first coordinate to ease the burden of notation. In application, 
we will use this in an arbitrary choice of coordinate. 
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What is more important concerns the collections T>^. For each dyadic interval I £ T>, I ± S\I\ € 
T>d- 3 Moreover, the maximal function M :Dd maps L 1 into L 1,0 ° 4 with norm at most 2d ~ | log <5 1 . In 
fact we need the finer estimate, valid for all choices of < S < 1 and integers d, 

\{M V " It; > 1 - S}\ < (1 + KSd)\U\ 

for all subsets U of the real line of finite measure and some constant K. This will be an effective 
estimate since the value of d we will consider is d ~ \— log 2 5~\ . To see this estimate, note that 

d-l 

|{M 27d l [/ >l-<5}|<|[/|+^ \U C D {M v "-^ l v > 1 - S}\ 

fc=0a6{±(2 d + l)- 1 } 

<(l + 2d[(l-5)- 1 -l])|C/|. 

The main line of the argument can now begin. We take 5 = (l + 2 d ) _1 for an integer d. We use the 
maximal functions M x ' d , but only in the last step of the induction. Initialize Enl(?i + l,W') := sh(W), 
so that we use backwards induction. Inductively define 

Enl(j,W) := {Mf > 1 - 6*}, n>j> 2, 

(6-4) x 

V := {Mf d l Enl(2>w0 > 1 - -}, 

where the subscript on the maximal functions denotes the coordinate in which the maximal function 
is applied. Then it is the case that \V\ < (1 + A'(5|log<5|)|sh(W')|, where the constant K depends 
only on the dimension n. 

Now pass to a further subset U" C W such that for all R, R' e U" , we have 2 k ~ 1 < emb(i?, V) < 
2 k and if it is the case that \R[ \ < \Ri\, then we have the stronger inequality 40 • 2 fe <5~ 1 |i?' 1 | < \Ri\. 
We term this assumption "separation of scales" in the first coordinate. Under these assumptions, 
estimate (6.2) reduces to 

(6.5) J2\F(I,k,U")\ < |sh(W)|, W <ZU. 

lev 

Sufficiency is seen by noting that obtaining separation of scales necessitates dividing the rectangles 
into approximately log2 fe <5 -1 subclasses. Then multiplying by 2~ ek , one is able to sum over all 
scales k. 

Our strategy is to define, for dyadic intervals /, sets H{I) that are disjoint in /, contained in 
sh(W'), and for which 

^F(I,k,U") ^ M 1-HX-O 

for an appropriate maximal function M, where the implied constant is permitted to depend upon 
(5 and dimension n. It will in fact be of the order 0(S~ 2 ). An appeal to the Fefferman-Stcin 
maximal inequalities [5] will then prove (6.5). 

The sets H(I) are defined to be F(I, k,U") - G(I), where 



G(I):= IJ F(I',k,U"). 



I'ev 

By our separation of scales, the minimal dyadic interval / that contributes to this union contains / 
and satisfies 

40 • 2 k S~ 1 \I\ < \I\ < 80 • 2 k 5~ 1 \I\. 



^The problem we are avoiding here is that the dyadic grid distinguishes dyadic rational points. At the point for 
instance, ((0, 1) — 8) <£_ (0, 2 fc ) for all integers k, regardless of how big k is. 

4 In fact, taking d = 1, it is routine to check that Jlf" 1 dominates an absolute multiple of the usual maximal 
function, thus, proving that it satisfies the weak type inequality. 
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We need to show that if R is a dyadic rectangle with \RnG(I)\ > (l-5 2 " +1 )|i?|, then emb(R, V) > 
2 k+1 , and hence it can't be among those rectangles that contribute to F(I,k,U"). 5 This will be 
accomplished by the following device. We will show that 

(6.6) \Ix R 2 x • ■ ■ x R n n Enl(2,W')| > (1 - 5 2 ' 2 )\I x R 2 x • ■ ■ x R n \. 

As I ± ||J| € T>d, and this is the grid we use in the final step in the construction of V, we conclude 
that the rectangle I X R% X • • • X R n is inside of V. Even S\I\ > 20 • 2 k \I\, therefore we see that 
emb(R,V) > 2 k+1 , as desired. 

We turn to the proof of (6.6). The sequence of powers of S that appear in the definition of V, 
(6.4), is explained in part by the next proposition. 

Proposition 6.7. Let < X < 1 be a random variable on a probability space satisfying EX = 1 — 77. 
Then, 

P(X < 1 - yfq) < V^. 
PROOF. Setting p = P(X < 1 - y/fj), the inequality 

1 - -q < EX < (1 - ^q)p +l-p 
will prove the proposition. □ 

We continue with the language of probability. Let (f2j,Pj) be the probability spaces 

fii := I 

Qj := 7 x i? 2 x ■ ■ • x Rj, 2<j<n, 

and let Fj be normalized Lebesguc measure on flj. The first of the relevant sequence of random 
variables on these spaces is 

\({x}xR n )nG(I)\ 
X n -i(x) := -— , x£ll n -i, 

A„_i := {x- I X„_i(.t) > 1-5 2 "}. 

Since |i?nG(/)| > (1 - c5 2 " +1 )|i?|, EX„_i > l-5 2 " + \ and applying the proposition, P„_i(^4 n _i) > 
1 — S 2 . Continuing by reverse induction, define for n — 1 > j ' > 2 

X^):= l{iX}X « j } nAjl , 



Aj-i :={x\X^ l {x)>l-5 2 }. 

Induction gives us the conclusion that Pi (Ai) > 1— 5 22 . This implies (6.6) by inspection of definitions 
and so completes the proof. 

A Second Geometric Formulation. We need a certain variant of the previous lemma. Given a 
set of rectangles U, we let cmb(-) : U — > [1, 00) be a map from U to the reals greater than one. And 
we take 1 : U — * {1, 2, . . . , n} which is simply a choice of coordinates. Based on these two data, for 
any subset U' C U we set 

(6.8) F(I,k,m,U') = \J{R e W \ 2 k < emb(R) < 2 k+1 , i(R) = m, R m = I}. 



^If we could use the strong maximal function to define embeddedness, this conclusion would be immediate. Our 
more subtle definition of embeddedness appears to force the more complicated argument that follows. 
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Lemma 6.9. Fix (5, e > 0. For any collection of rectangles U with finite shadow, we can select 
V D sh(U), and data cmb(-) and i so that the following conditions hold. 

\V\<{l + 8)\ S h{U% 
emb{R)R C V, ReU, 

CO n 

k=0 m=l lev 

The implied constant in the last line depends only on S, e and dimension. 

The essential points for us are that the set V is not much larger than the shadow of U, and that 
the rectangles R G IA, after a dilation uniform in all coordinates by the embcddedncss quantity, is 
contained in the enlarged set V. We find that the embcddedncss quantity in the last line requires 
a large negative power, but that is a completely harmless fact in the context of the application we 
have in mind. 

Again, the fact that the dyadic intervals distinguish certain points causes some difficulties for 
us, and we appeal to the shifted dyadic intervals (6.3) of the previous subsection, though our needs 
are not so refined in the current context. Let S be the union of the dyadic intervals with the two 
collections T>± := T> 10 ±i. For any interval / of the real line, we can find an interval J G S with 
I C J C 47. Indeed, let J' be the maximal dyadic interval contained in 4/ with \I fl J'\ > h\I\- If 
/ C J' we are done, so assume that this is not the case. We necessarily have \J'\ > ||/|, so that 
one of the two intervals J' ± || J'\ contains /. Both of these intervals are in S, so we are done. 

The method of proof requires that we apply Lemma 6.1, although we find it necessary to apply 
it both inductively and to a wide range of possible collections of rectangles. In fact, it is useful to 
us that this lemma applies not just to collections of a subset U C <8>™ =1 S such that the shadow of U 
is of finite measure. It also applies to all possible subsets of Li. 

We apply Lemma 6.1 to U° := U. Thus, we get a set V 1 D sh(U°), with < (1 + 5)\sh(U°)\, 
so that for 

emb^i?, V^ 1 ) := sup{^ > 1 | fJ,Ri X R 2 X • • • X Rn C V 1 }. 
we have the conclusion of Lemma 6.1 holding. We then construct U 1 C <8>" =1 6>. Set 

U 1 := {7 x ®] =2 Rj \R€U, 7 G S, 

(i?i Uiemb 1 ^^ 1 )^) C 7 C emb^-R.V 1 ).^}. 

The inductive stage of the construction is this. For 2 < m < n, given lA m ~ x C (g)" =1 <S, we apply 
Lemma 6.1 to get a set V m satisfying 

V m D shiU" 1 - 1 ), \V m \ < (1 + S)\sh(U m - 1 )\. 

The embedding function for rectangles R G U" 1 ^ 1 is 

emb m (i?, V rn ) := sup{n > 1 \R X X • • • X i? m _i x ^iR m 

x R m+1 x • • • x R n C V m }. 

And the conclusion of (6.2) holds. The collection U m is then taken to consist of all rectangles of the 
form 

®7=iR 3 x 7 x ®] =m+2 Rj 

where R G W m_1 and 7 G S satisfies 

(R m U i cmb m (R, V m )R m ) c 7 C emb m (R, V m )R m . 
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To prove Lemma 6.9, we take V := V n . It is the case that 



\v n \ < (l + ^ish^™- 1 )! 

< (i + ^i^™- 1 ! 

< (1 + S) n \sh(U)\. 



The definition of the embedding function is not so straight forward. It is taken to be 



emb(i?) = inf f3 m {R) 



where (3 m (-) are inductively defined below. The function i(R) is taken to be the coordinate in which 
the infimum for the embedding function is achieved. 

Set P 1 (R) := emb 1 (R,V 1 ). In the inductive step, for 2 < m < n, set 7 m (i?) := inf J<m /3 J (ii). 
For 1 < 7 < 7 m (i?), let 



where (f™(R) £ U m 1 is the rectangle with <p™(i?)j = Rj for j > m, and for 1 < j < m, tp™(R)j is 
the element of S of maximal length such that 



Now, take 7 to be the largest value of 1 < 7 < 7, n (i?) for which we have the inequality > 7. 

Let us see that this definition of 7 makes sense. This last inequality is strict for 7=1, and as 7 
increases, @™(R) decreases, so 7 is a well defined quantity. Then define /3 m (i?) :— f3^{R), and for 
our use below, set (p m (R) := (p^(R). 

The choices above prove our lemma, as we show now. For each rectangle R € U, it is clear that 
cmb(i?)i? C V. Take U' C U. If we consider the sets F(I, k, m,W) as in (6.8), then, by Lemma 6.1 
applied in the mth coordinate, 



While we have a very good estimate for the shadow of ip m (U), a corresponding good estimate for 
an arbitrary subset W seems very difficult to obtain. But it is a consequence of our construction 
that the rectangle (p m (R) is a rectangle which agrees with R in the coordinates j > m and, for 
coordinates 1 < j < m, is expanded by at most 32emb(i?) < 2 fe+6 . Hence, we have the estimate 



This follows from the weak L bound for the maximal function in one dimension, applied in each 
coordinate separately. It is in this last step that we lose the large power of the embeddedness. Our 
proof is complete. 

The £>MO_i(<g>™C+) Formulation. The form in which we apply the previous lemma is this. 

Lemma 6.10. Given a function b with finite BMO_i(8iC + ) norm and a collection of rectangles 
U whose shadow has finite measure, the following construction is possible. For all e, 5 > 0, there is 
a set V D sh(U) with \V\ < (1 + <5)|sh(W)|. To each R GW, there is a quantity cmb(i?) > 1 so that 



:=emb m ^Z L (R),V m ) 



{R ] yj\ 1 R 3 )c^p m {R) J ^ 1 R ] . 



J2\F(I,k,m,U')\ <¥ k \ S h(<p m (U'))\. 



lev 



|J{^ m (i?) \ReU'} < 2 nk \sh{U')\. 



emb(R)R C V, 



Reu, 



emb(R) 



(n+e) 



(b, v R )v R 



< Ks,e\\b\\BMO- 1 (®?C + )- 



Reu 



BMO(®?C + ) 
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For the proof, we apply Lemma 6.9. To check the conclusion of the lemma, wc take a subset 
W C U consisting of rectangles with 2 k < emb(i?) < 2 fe+1 . We then have 

n 

E KM*>I 2 ^ II & IIIa./ _ i( ^ c+ ) E £l*V,*,ro,«')l 

< IHI W l( ^c + )2 ( " +e)fc |sh(W')|. 
This is all we need to prove Lemma 6.10. 
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